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i!B ■ ABSTRACT 

I ' We present new near-infrared images oi z > 0.8 radio galaxies from the flux-limited 7C- 

O \ III sample of radio sources for which we have recently obtained almost complete spectroscopic 

-4— » ' redshifts. The 7C objects have radio luminosities « 20 times fainter than 3C radio galaxies at a 

^ \ given redshift. The absolute magnitudes of the underlying host galaxies and their scale sizes are 

only weakly dependent on radio luminosity. Radio galaxy hosts at z ~ 2 are significantly brighter 
. ^ , than the hosts of radio-quiet quasars at similar redshifts and the model AGN hosts of Kauffmann 

' & Haehnelt (2000). There is no evidence for strong evolution in scale size, which shows a large 

H \ scatter at all redshifts. The hosts brighten significantly with redshift, consistent with the passive 

evolution of a stellar population that formed at z > 3. This scenario is consistent with studies 
of host galaxy morphology and submillimeter continuum emission, both of which show strong 
evolution at z > 2.5. The lack of a strong "redshift cutoff" in the radio luminosity function to 
z > 4 suggests that the formation epoch of the radio galaxy host population lasts 1 Gyr from 
z > 5 to z ~ 3. We suggest these facts are best explained by models in which the most massive 
galaxies and their associated AGN form early due to high baryon densities in the centres of their 
dark matter haloes. 

Subject headings: galaxies: evolution - galaxies: active - galaxies: formation 

1. Introduction lO^^WHz"^ are associated almost exclusively with 

giant elliptical host galaxies. If this continues to 
At low redshifts, FRI and FRII radio sources ^e the case out to high redshift, then radio galax- 

with radio luminosities at 151 MHz of ^^(151) ^ ies can give us a unique insight into the forma- 

' tion and evolution of a single class of massive 

^Current address: Institute of Astronomy, Madingley , ... .11 •;■ • 1 t i ; 

Road, Cambridge, CBS OHA, UK galaxy. This IS particularly exciting m the light 
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of submillimetre detections of z ^ 4 radio galax- 
ies (e.g. Archibald et al. 2000) which may indi- 
cate that we can see these objects during their 
major bursts of star formation [although sec also 
Willott, Rawlings & Jarvis (2000)]. Furthermore, 
the similarity of radio galaxy hosts, in contrast to 
the wide range in luminosity of radio-quiet quasar 
hosts (McLure et al. 1999; Ridgway et al. 2000), 
suggests that one of the conditions necessary for 
producing powerful radio jets is the presence of a 
massive spheroidal component, and therefore a su- 
permassive (^ IO^Mq) black hole (Lacy, Ridgway 
& Trentham 2000). 

Studies of high redshift radio galaxy hosts have 
traditionally concentrated on the K — z Hubble 
Diagram. Work on the 3C and 1 Jy samples by 
Lilly (1989 and rcfs. therein) initially pointed to a 
passively-evolving stellar host formed at high red- 
shift which evolved into the giant elliptical radio 
galaxy hosts seen today, but this was challenged 
when some high redshift radio galaxies were found 
to have significant emission line contributions to 
their K-hand Hght (Bales & Rawlings 1993, 1996). 
Only by finding low AGN-luminosity radio galax- 
ies at high redshift could the controversy be re- 
solved. Bales et al. (1997) used the 6C sample, 
a factor of five fainter in radio flux than the 3C 
sample, to show that there did seem to be a radio 
luminosity dependence of host galaxy magnitude. 
Further work by Roche, Bales & Rawlings (1998) 
indicated that the hosts of 6C radio galaxies were 
not only significantly fainter than their 3C coun- 
terparts, but also had smaller scale sizes. 

Wc have used the 7C-III radio galaxy redshift 
survey of Lacy et al. (1999b) to select a complete 
sample of z > 0.8 radio galaxies. The 7C redshift 
surveys are a factor of 4 5 lower still in luminosity 
at a given redshift than the 6C sample of Bales et 
al. (1997), and thus allow the study of high redshift 
radio galaxy hosts over a wide range in radio lu- 
minosity (Willott et al. 1999). The 7C-III sources 
were imaged in the near-infrared on the 3-m NASA 
Infrared Telescope Facility (IRTF) and the 3-m 
Shane Telescope at Lick Observatory. These data 
have allowed us to further investigate the radio lu- 
minosity dependence of host properties at z ^ I 
and, because our 7C objects at z ~ 2 have simi- 
lar radio luminosities to z ~ 1 6C radio galaxies 
and z ~ 0.3 3C radio galaxies, we can also inves- 
tigate host galaxy evolution over a wide range in 



redshift. 

We assume a cosmology with f^M = 1, f^A = 
and Ho = 50kms~^Mpc~^ except where otherwise 
stated. 

2. Observations 

Most sources were observed on the IRTF with 
NSFCAM, a near infrared imaging camera em- 
ploying a 256 X 256 InSb array, on the nights of 
1999 July 28 - 29 UT. The 0.3 arcsec/pixel scale 
was used for all observations. Details of the ob- 
servations are given in Table 1. One of the J, 
H or K filters was chosen for each object so as 
to cover as far as possible the rest-frame i?-band, 
thus minimizing the fc-correction and the associ- 
ated uncertainties. In practice this meant that ob- 
jects with 0.8 < z < 1.2 were observed in J-band, 
those with 1.2 < z < 1.8 in iJ-band and those 
with z > 1.8 in K-h&nd (using a K filter). Bxcep- 
tions to this were two objects with uncertain red- 
shifts, 70 1756-^6520 and 70 1804-^6313, which 
were both observed in K for ease of estimating 
photometric redshifts. The sky was clear through- 
out the run, and most objects were observed in 
conditions of sub-arcsecond seeing. Two more ob- 
jects with uncertain redshifts were observed using 
the Oemini instrument (McLean et al. 1993, 1994) 
on the Shane Telescope at Lick Observatory on the 
nights of 1999 June 1 and 4 UT. Gemini was used 
with a dichroic beamsplitter which enabled us to 
image in J and K simultaneously. The detec- 
tor on the short wavelength arm was a 256 x 256 
HgOdTe array, and that on the long wavelength 
arm a 256 x 256 InSb array. The image scale in 
both arms was 0.68 arcsec/pixel. 70 1814-1-6704 
was observed in both runs, but the better seeing 
of the IRTF data allowed more accurate photome- 
try in the moderately crowded field so only the 
IRTF data is presented. In addition a K im- 
age of 70 1745-1-6624 was presented in Lacy et 
al. (1999b). 

The data were reduced using the dimsum pack- 
age in IRAF, and fiat-fielded using dome fiats. The 
final images were magnified by a factor of two be- 
fore combination to improve the sampling of the 
final image. The reduced images are shown in Fig. 
1 and the photometric properties detailed in Table 
2. 
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Table 1 
Observing log 



Object 


Telescope 


z 


Filter 


iint/min 


FWHM PSF/" 


7C 1733+6719 


IRTF 


1.84 


k' 


18 


0.95 


7C 1740+6640 


IRTF 


2.10 


k' 


18 


0.65 


70 1741+6704 


IRTF 


1.05 


J 


13.5 


0.65 


70 1742+6346 


IRTF 


1.27 


H 


18 


0.90 


70 1748+6703 


Shane 


? 


J kK' 


56 


1.3 


70 1748+6657 


IRTF 


1.05 


J 


13.5 


0.90 


70 1751+6809 


IRTF 


1.54 


H 


18 


0.56 


70 1753+6311 


Shane 


1.96? 


J k k' 


82 


1.5 


70 1754+6420 


IRTF 


1.09 


J 


13.5 


1.07 


70 1756+6520 


IRTF 


1.48? 


k' 


18 


0.78 


70 1758+6719 


IRTF 


2.70 


k' 


82 


0.42 


70 1802+6456 


IRTF 


2.11 


k' 


18 


0.74 


70 1804+6313 


IRTF 


? 


k' 


18 


0.80 


70 1805+6332 


IRTF 


1.84 


K' 


36 


0.63 


70 1807+6841 


IRTF 


0.82 


J 


13.5 


0.72 


70 1807+6719 


IRTF 


2.78 


k' 


36 


0.75 


70 1812+6814 


IRTF 


1.08 


J 


13.5 


0.93 


70 1814+6702 


IRTF 


4.05? 


K' 


54 


0.78 




Shane 




k' 


54 


1.3 


70 1814+6529 


IRTF 


0.96 


J 


13.5 


0.84 


70 1820+6657 


IRTF 


2.98 


k' 


36 


0.69 


70 1816+6605 


IRTF 


0.92 


J 


13.5 


1.02 


70 1825+6602 


IRTF 


2.38 


K' 


18 


0.81 
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7C1733+6719 (if') 7C1740+6640 (if') 7C1741+6704 (J) 





7C1751+6809 (H) 7C1753+6311 (if') 7C1754+6420 (J) 





Fig. 1. — Near-infrared images of the z > 0.8 7C sources. The position of each identification is indicated by 
the dotted circle. The images are « 30 arcsec on a side. 
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7C1804+6313 (/-r') 7C1805+6332 (if') 7C1807+6719 (if') 




7C1807+6841 (J) 7C1812+6814 (J) 7C1814+6529 (J) 




7C1814+6702 {K') 7C1816+6605 (J) 7C1820+6657 {K') 




7C1825+6602 (K') 




Fig. 1. — continued 
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3. Analysis 



3.1. Magnitudes and fc-corrections 

Aperture magnitudes were measured as listed 
in Table 2 and corrected to a standard 63.9 kpc 
metric aperture according to the prescription of 
Eales et al. (1997), which assumes a power-law 
curve of growth of the form /(< r) oc r'^-^^ for 
z > 0.6 objects. Our metric aperture corresponds 
to about 8 circsGC &t z ^ 1 in our assumed cos- 
mology, and this angular size is not strongly de- 
pendent on the choice of f^M and For many 
of our objects we have been able to measure the 
magnitude in a large aperture, but for some we 
have had to restrict the aperture to w 3 arcsec to 
avoid contamination from neighbouring objects or 
excessive noise in the case of faint objects. The 
aperture correction applied in these cases is w 0.4 
magnitudes. The disadvantage of this technique is 
that the aperture magnitude and in particular the 
correction are scale-size dependent. An alterna- 
tive would have been to use total magnitudes (cf. 
Roche et al. 1998). However, we decided against 
this as for low signal-to-noise detections the curve 
of growth is ill-defined and can lead to large er- 
rors in the magnitudes (Stevens 1999). The mean 
half-light radius of our objects is 0.9 arcsec, so 
the typical amount of flux missed in an 8 arcsec 
aperture will be very small in any case. We thus 
believe that the use of these large aperture mag- 
nitudes will not seriously affect the results of this 
paper. 

For objects with spectroscopic redshifts, k- 
corrections to rest-frame i?-band were made using 
IGyr-burst models generated using the PEGASE 
code (Fioc & Rocca-Volmerange 1997), integrat- 
ing over the appropriate filter profiles. A model 
galaxy with a total age of 2 Gyr was used for ob- 
jects with 2 < z < 3, and 3 Gyr for objects with 
0.8 < 2; < 2. The z>3 objects used a 1.4 Gyr-old 
model galaxy. For most objects the fc-corrections 
are small (< 0.2 magnitudes), the exceptions to 
this being the z > 3 objects for which K cor- 
responds to rest-frame wavelengths significantly 
shorter than i?-band (but still above 4000A) and 
7C 1756-1-6520 which was observed in K to check 
its redshift photometrically. The biggest correc- 
tion was —0.61 mag. for 7G 1814-1-6704 at a prob- 
able redshift of 4.05. 
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Table 2 

Photometry and scale sizes of the 7C-in radio galaxies 
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7C 


1758+6719 


8 


k' 


= 19.3 + 0.2 


-24.5 


0.35 


0.12-1.0 


2.6 





7C 


1802+6456 


8 


k' 


= 19.3 + 0.3 


-24.1 


0.97 


0.15-7.0 


7.9 


50 


7C 


1804+6313 


3 


k' 


= 19.1 + 0.2 










? 


7C 


1805+6332 


5 


k' 


= 19.0 + 0.2 


-23.5 


1.9 


0.52-7.1 


16 





7C 


1807+6719 


5 


k' 


= 20.2 + 0.3 


-23.9 











7C 


1807+6841 


8 


J = 


= 18.8 + 0.2 


-23.7 


0.87 


0.54-1.4 


7.2 


6 


7C 


1812+6814 


5 


J = 


= 20.4 + 0.3 


-22.9 


1.1 


0.17-10 


9.4 


5 


7C 


1814+6702 


5 


K' 


= 19.4 + 0.2 


-25.7 











7C 


1814+6529 


8 


J = 


= 19.2 + 0.15 


-23.9 


0.89 


0.35-2.3 


7.6 


<3 


7C 


1816+6605 


5 


J = 


= 18.6 + 0.15 


-24.4 


0.79 


0.56-1.1 


6.7 


1 


7C 


1816+6710 


5 


H 


= 19.0 + 0.3 













7C 


1820+6657 


3 


k' 


> 20.7 


>-23.7 








>5 


7C 


1825+6602 


5 


k' 


= 19.0 + 0.2 


-24.7 


0.59 


0.56-0.63 


4.6 


10 



Note. — Errors on the magnitudes are estimated assuming a photometric error of 0.1 mag- 
nitude combined with noise. ^ is the aperture diameter and Mr is measured in a 63.9 kpc 
metric aperture. The limit on the magnitude of 7C 1820+6657 is a 3-cr hmit. 
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Fig. 2. — An example of the model fitting procedure to the galaxy 7C 1758+6719. The galaxy itself, the 
residual of the galaxy minus the best fit model (rhi = 0.35 arcsec) and the galaxy minus the approximate 
±1(7 bounds on rhi (0.12 and 1.0 arcsec) are shown, running from left to right. All images have the same 
symmetric greyscale stretch, with black corresponding to a surface brightness of K = 20.8mag arcsec"^, 
and the subtractions have been arranged such that each residual image contains zero net flux. Thus a 
model with too small a scale size results in a compact negative residual near the centre of the image where 
oversubtraction has occurred, and a model with too large a scale size leaves a compact positive residual 
near the centre. The flux in the compact residuals left after subtraction of the ±lcr models, measured in a 
2-arcsec diameter aperture, are ±0.2 times the total galaxy flux in the same aperture. 
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3.2. Photometric redshifts for objects in 

the 7C-III sample 

Several of the objects observed had uncer- 
tain redshifts [grade 7 in Lacy et al. (1999b)] 
or no rcdshift information at all. Of the objects 
without spectroscopic redshifts, 7C 1748+6703 
has a K magnitude and colours consistent 
with 2.4 < z < 4 whereas 7C 1753+6311 and 
7C 1804+6313 both have magnitudes and colours 
consistent with them having redshifts in the range 
1.2 < z < 1.8 (Willott ct al. 2000). In addition 
Lacy et al. (1999b) estimated a photometric red- 
shift of 1.7 for 7C 1743+6341. The identification 
of 7C 1753+6311 in Lacy ct al. (1999b), which 
had a provisional redshift of 1.96, actually cor- 
responds to a blue galaxy along the radio axis. 
Our Lick image showed that the true identifica- 
tion is a very red {R — K > 5) galaxy 4.7 arcsec 
away which is significantly closer to the likely ra- 
dio central component. Nevertheless, the original 
identification may well be associated with the ra- 
dio galaxy, and the magnitude and colours are 
consistent with z < 2, so we have not revised our 
redshift estimate. As we aligned the slit with the 
radio axis, though, we might have expected to see 
Lya in the spectrum if the rcdshift were indeed 
1.96. Thus we note that the redshift may well be 
lower. 7C 1756+6520 had a provisional redshift of 
1.48, and its K magnitude is just consistent with 
this, although somewhat fainter than the mean 
K - z relation at this redshift. 7C 1814+6702 
with a provisional redshift of 4.05 has a K mag- 
nitude in both the IRTF and Lick data consistent 
with both z ~ 4 and 1.2 < 2; < 1.8, although 
its diffuse and possibly aligned structure is typi- 
cal of z > 3 objects (van Breugel et al. 1998) so 
has been kept at 4.05 pending a deeper spectrum. 
7C 1816+6710 was observed in _ff-band by Lacy 
et al. (1999a), its magnitude is consistent with 
its provisional redshift of 0.92. Only one object, 
7C 1820+6657, was undetected in our study; its 
faint magnitude is consistent with its provisional 
redshift of 2.98. 

In summary, we now have photometry on all 
nine of the high redshift radio galaxies with ei- 
ther no redshift information or uncertain redshifts 
in the 7C-III sample of 54 objects. All six objects 
with uncertain redshifts have near-infrared magni- 
tudes consistent with their tentative spectroscopic 
redshifts. Of the three objects with no redshift 



information, two have photometry consistent with 
them having redshifts ~ 1.5, and the one remain- 
ing object (70 1748+6703) is probably at z ~ 3. 

3.3. Emission line contamination of broad- 
band magnitudes 

Line contamination of continuum magnitudes 

is expected to be small in nearly all the 7C ob- 
jects, as the correlation of emission line and ra- 
dio luminosities means that, on average, emission 
line fluxes will be much less than those from ob- 
jects in radio-brighter samples. Whereas the z ^ 2 
objects studied by Eales & Rawlings (1996) typi- 
cally have line contaminations of a few tens of per- 
cent, the 70 objects, which are selected at radio 
fluxes about four to five times lower, should have 
line contaminations ;5 10 %. We have listed in 
Table 2 the estimated percentage contribution to 
the observed flux for all objects with bright emis- 
sion lines in the observed band. These are based 
on the spectroscopy of Lacy et al. (1999b), using 
the median LyQ:/Ha+[Nll] ratio of 0.7 in Eales 
& Rawlings (1996), the line ratios in the com- 
posite radio galaxy spectrum of McOarthy (1993) 
and the Ha/H/3 ratios of Koski (1978) to estimate 
the strengths of lines falling in the observed near- 
infrared. As expected, most objects have emission 
line contributions of < 10 % . For only one object, 
70 1802+6456, is the emission line contamination 
expected to be > 20 % . 

3.4. Estimation of half-light radii 

The seeing during most of our IRTF observa- 
tions was very good, and it was clear from looking 
at the images that there was a wide range in the 
scale sizes of the hosts. We therefore selected a 
subsample consisting of the 15 sources with grade 
a or /3 redshifts in the range 0.8 < z < 2.7 in 
Lacy et al. (1999b) for which we had IRTF im- 
ages. Although not complete, this sample should 
be representative of the 70 objects in this redshift 
range. To estimate the scale sizes we convolved 
model elliptical (de Vaucouleurs) galaxies having 
a range in half-hght radius rhi from 0.1 to 6.7 arc- 
sec and zero ellipticity with a PSF from a nearby 
star in each IRTF image. The model was sub- 
tracted from the data and the minimum in the 
sum of the squares of the residuals found, along 
with the approximate ±lcr range. An example of 
this process is shown in Fig. 2. The results were 
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also checked by eye to sec that the fits were rea- 
sonable. Nearby companion objects and discrete 
sub-components were subtracted prior to the fits. 
Although this is far from a full host galaxy model 
(in particular we made no attempt to discriminate 
between disc and de Vaucoulours fits), we feel that 
it should give a fair estimate of the scale size, and 
was all that the signahnoise in the images typically 
justified. 

To obtain a major axis scale size to compare 
to other work, we need to assume a mean elliptic- 
ity, e, and correct our scale sizes with this. Roche 
et al. and Govoni et al. (2000) derive very similar 
mean values of e « 0.2 for the 6C radio galaxies 
at 2; ^ 1 and nearby radio galaxy hosts respec- 
tively, so this value has been assumed to correct 
the mean scale sizes quoted in Section 4 by multi- 
plying the scale size by the square root of the axial 
ratio corresponding to e = 0.2, namely 1.06. 

For two objects we have been able to compare 
the scale sizes with those measured on images 
taken with the Hubble Space Telescope (HST). 
7C 1758+6719 (z = 2.70) was observed with 
WFPC2 through the F702W filter; details of the 
observations will be given in a future paper (Lacy 
et al. in preparation). For this rest-frame UV im- 
age an exponential disk with rhi = 0.21 arcsec was 
found to be a good fit to the radial profile. This 
can be compared with the estimate of rhi from the 
IRTF image of 0.35 arcsec with a la range of 0.12 
- 1.0 arcsec, which we consider fair agreement. 
7C 1754-^6420 (z = 1.09) was observed through 
the F675W filter (Ridgway & Lacy in prepara- 
tion) . Again, the rest-frame UV emission was bet- 
ter fit by a disk than a dc Vaucoulcurs profile. 
In this case the scale size came out significantly 
smaller than the estimate from the IRTF image, 
0.7 arcsec in the HST image compared to a best 
fit of 4.5 arcsec and a range of 1.6 - 13 arcsec for 
the IRTF image. This was despite the subtrac- 
tion of a nearby companion to the south which 
was successfully removed from both the HST and 
IRTF images. The cause of this discrepancy is not 
clear. This image had the worst seeing of any of 
our IRTF images (1.1 arcsec), but the scale size 
measured in the IRTF image is much larger than 
the seeing HWHM. The radial profile of the IRTF 
image is, however, not at all well fit by the disk 
model from the HST image. Therefore perhaps 
the most likely cause of this discrepancy is that 



the galaxy has a UV-bright disk component em- 
bedded in a much larger scale-size elliptical. 

3.5. Other data from the literature 

To increase the number of high redshift objects 
in our study we have added the results of photom- 
etry of ^ ~ 2 — 5 radio galaxies by van Breugel 
et al. (1998). We also estimated scale sizes from 
their figure 2 for objects which appeared to be 
dynamically-relaxed ellipticals (seven out of the 
eight of their radio galaxies with z < 3). We 
applied k- and aperture corrections in a consis- 
tent manner to that for the 7C-III radio sources. 
We have also added those 1.6 < z < 2.4 objects 
without a strong point source contribution from 
the NICMOS study of radio galaxies by Pentericci 
(1999; and Pentericci et al. 2000), including the 
estimates of scale sizes for the five out of the nine 
objects in this redshift range for which Pentericci 
considers it possible to fit meaningful scale sizes. 
We also include the low luminosity z = 4.42 ra- 
dio galaxy VLA 123642-^621331 discovered in the 
flanks of the Hubble Deep Field (Waddington et 
al. 1999) and 53W002 at z = 2.239 (Windhorst, 
Mathis & Keel 1992). 

The resulting high redshift (HZ) sample is de- 
tailed in Table 3. It mostly contains objects of ra- 
dio luminosity comparable to the 3C radio galax- 
ies at z ~ 1. As these objects (particularly those 
at z > 3) were generally observed outside of rest- 
frame i?-band, and are mostly not from complete 
samples, the results including the HZ sample will 
not be as reliable, though as we shall see the same 
trends seem to exist with or without this sample. 

4. Discussion 

4.1. Radio-luminosity and redshift depen- 
dence of absolute magnitudes 

In Fig. 3 we plot the rest-frame ii-band mag- 
nitudes against redshift for several samples of ra- 
dio galaxies. We plot the z > 0.8 7C-iii objects 
with spectroscopic redshifts and infrared imaging, 
z > 0.8 6C radio galaxies with photometry from 
Bales et al. (1997), z > 0.8 3C radio galaxies 
in the Laing, Riley & Longair (1983; LRL) com- 
plete sample with photometry from Best, Longair 
& Rottgering (1998) [apart from 3C22 which is 
a lightly-reddened quasar (Rawlings et al. 1995) 
and therefore excluded from the sample, 3C 175.1 
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Table 3 

The "HZ"' sample of other hich redshift radio galaxies from the literature 







z 




Mag. 


Mr 






rci. 


6C 0140+326 


4, 


.41 


8 


i^« = 


-- 20.0 


-25, 


.5 




10 


1 


MRC 0152-209 


1 


.89 


4 


iJ = 


18.7 


-25, 


.3 


0.2 


6 


2 


MRC 0156-252 


2 


.09 


4 


H = 


18.4 


-25, 


.9 




5 


2 


USS 0211-122 


2 


.34 


4 


H = 


19.6 


-25, 


.0 


0.73 


3 


2 


MRC 0324-228 


1 


.89 


4 


H = 


19.9 


-24, 


.1 







2 


4C 60.07 


3 


.79 


8 


k' = 


= 19.3 


-25, 


.4 







1 


4C 41.17 


3 


.80 


8 


Ks = 


-- 19.2 


-25, 


.5 







1 


B3 0744+464 


2, 


.926 


8 


Ks = 


= 18.5 


-25, 


.5 


0.92 





1 


MRC 0943-242 


2, 


.922 


8 


K = 


19.2 


-24, 


.8 


0.70 





1 


MG 1019+0534 


2 


.765 


8 


K = 


19.1 


-24, 


.7 


0.97 





1 


3C 257 


2, 


.474 


8 


K = 


17.8 


-25, 


.7 


1.55 


35 


1 


VLA 123642+621331 


4, 


.424 


1.5 


K = 


21.4 


-24, 


.8 


0.29 





3 


4C 1243+036 


3 


.581 


8 


K = 


19.3 


-25, 


.2 







1 


USS 1410-001 


2, 


.33 


4 


H = 


19.3 


-25, 


.4 




9 


2 


8C 1435+635 


4, 


.251 


8 


K = 


19.5 


-25, 


.7 







1 


MRC 1707+105 


2 


.35 


4 


H = 


18.3 


-24, 


.5 


1.62 





2 


53W002 


2, 


.239 




K = 


19.2^ 


-24, 


.1 


1.1 


20 


4 


MRC 2025-218 


2, 


.630 


8 


k' = 


= 18.5 


-25, 


.2 


0.64 





1 


MRC 2048-272 


2 


.06 


4 


H = 


20.2 


-24, 


.1 


0.64 





2 


MG 2121+1839 


1 


.861 


8 


K = 


18.7 


-24, 


.1 


3.07 





1 


MG 2144+1929 


3 


.594 


8 


k' = 


= 19.2 


-25, 


.3 







1 


TX 2202+128 


2, 


.704 


8 


K = 


18.4 


-25, 


.4 


0.19 





1 


MRC 2224-273 


1, 


.68 


4 


H = 


19.2 


-24, 


.4 


0.2 


28 


2 


4C28.58 


2 


.905 


8 


K = 


18.7 


-25, 


.3 







1 



''Total magnitude 

Note.— References: (1) van Breugel et al. (1998); (2) Pentericci (1999); (3) 
Waddington et al. (1999); (4) Windhorst et al. (1992) 



11 



(z — 0.92) which has photometry from Ridgway & 
Stockton (1997) and 3C 263.1 (z = 0.824) which 
has photometry from Bales (personal communi- 
cation)]. We have also added local radio galax- 
ies in LRL from Owen & Laing (1989) and the 
HZ sample of galaxies of Table 3. All these ob- 
jects, with the exception of three FRIs out of the 
24 objects in the Owen & Laing LRL sample are 
either FRII or compact steep-spectrum sources. 
All the z > 0.8 7C-III somces at z > 0.8 arc 
well above the FRI/FRII boundary in radio lu- 
minosity, with 1/^(151) > 10^^WHz~^sr~^ com- 
pared to the FRI/FRII boundary at iii(i5i) ^ 
1025WHz-isr-i. 

There is a clear trend for redshift and abso- 
lute magnitude to correlate, even if the incom- 
plete HZ sample is excluded. The 3C, 6C and 
7C samples are all complete samples, selected on 
the basis of low frequency radio flux only, and 
are nearly completely identified. Thus the only 
selection effect which needs to be considered for 
these samples is the tendency for redshift and 
luminosity to correlate within each flux limited 
sample. With the wide range in radio luminosi- 
ties in the complete samples at z ~ 1, however, 
we can separate out the luminosity dependence. 
This is illustrated in Fig. 4, where we have plot- 
ted absolute magnitude against radio luminosity 
for 3C, 6C and 7C galaxies in the redshift range 
0.8 < z < 1.4. In this redshift range the mean 
magnitude in 3C is Mn = —24.17 ± 0.13, whereas 
in 6C it is -23.79±0.14 and in 7C-III -23.78±0.15. 
This suggests that only the most radio-luminous 
objects have slightly brighter (by w 0.4 mag) hosts 
[see also Rawlings et al. (1998) where preliminary 
K-hand photometry on the 7C-I and 7C-II samples 
is presented] . 

We can think of two possible explanations for 
this radio luminosity dependence of host mag- 
nitude at z ~ 1. The first is a contribution 
from AGN-related light, e.g. emission lines and 
reddened and/or scattered quasar light (Bales & 
Rawlings 1996). Rigler et al. (1992), however, 
show that the fraction of emission from z ~ 1 3C 
radio galaxies which is aligned with the radio jet 
axis, and therefore probably closely related to the 
AGN activity, contributes only f» 10% of the light 
in the observed K-hand. Also, Simpson, Rawlings 
& Lacy (1999) argue that reddened quasar light 
is responsible for < 10% of the observed near- 



infrared emission from 3C radio galaxies on the 
basis of 3/um imaging. Thus we believe that only 
~ 10 — 20 per cent of the host luminosity can be 
accounted for by the AGN, not the w 40 per cent 
required to explain the higher host luminosities of 
the 3C galaxies. The second possibility is that the 
radio luminosity may be correlated with the mass 
of the host. This could arise, for example, if cor- 
relation of black hole mass and the mass of the 
spheroidal component of galaxies which is claimed 
to be present at low redshift (e.g. Magorrian et al. 

1998) is appearing at the highest radio luminosi- 
ties (Roche et al. 1998). We have argued (Lacy, 
Ridgway & Trentham 2000; see also Willott et al. 

1999) that most radio galaxies are sub-Bddington 
accretors, but at the highest luminosities corre- 
sponding to the highest black hole masses, even 
the radio galaxies may be accreting at near Ed- 
dington rates. Thus at these highest luminosities 
we might expect a host galaxy mass - AGN lumi- 
nosity (and therefore host luminosity) correlation 
to appear. Also, the mass of the host may corre- 
late with the density of the intergalactic medium 
confining the radio source, which would enhance 
the radio luminosity. 

We have made a crude model of the luminos- 
ity dependence by assuming a contribution from 
AGN-related light to a base host optical luminos- 
ity Lq. We model this contribution as a combina- 
tion of a component due to emission lines and an- 
other component due to other AGN-related light. 
(We separate the emission line contribution, as the 
equivalent width of the emission lines is a strong 
function of redshift, unlike the other AGN-related 
emission which we expect to be much less red- 
shift dependent.) Both these components should 
be proportional to the AGN luminosity, which we 
assume to be proportional to Lji{151) to the power 
0.8 (Serjeant et al. 1998; Willott et al. 1999), i.e. 

Lhost(2) =io(2) + (a + /3(^))i?f'i5i), (1) 

where ^-^^{(isi) contribution of AGN-related 

light other than emission lines and /3(z)L5ffi5i) is 
the contribution due to emission lines. The base 
host luminosity at z « 1 was set to Lo{l) = 
1023WHz-\ or Mr » -23.7, close to the mean 
of the 7C and 6C data, and (a -I- /3(1)) fit by eye 
to be « 3.5W°-2Hz-o-2 (Fig. 4). The emission line 
contribution to the total near-infrared luminosities 
of the 30 radio galaxies at these redshifts is ~ 10% 
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Fig. 3. — The rest-frame -R-band absolute mag- 
nitudes. Tlio valtic of M]S. locally is indicated by 
the dotted line. A typical error bar for the 7C 
objects is indicated; the other samples generally 
have smaller error bars. 
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Fig. 4. The luminosity dependence of host 
galaxy magnitude at z ~ 1. The line represents 
the model of equation (1). 



(Rawlings et al. 1997; Rawlings Eales & Lacy 
1990), or about 20% of the overall luminosity- 
dependent correction. We have therefore set a = 
2.8WO-2Hz-o-2 and /3(1) = 0.7 WO-^Hz-O-^. 

Using this formula we can then correct the host 
magnitudes for AGN-related light and produce 
plot of Lo versus redshift. [Rather than attempt to 
model the redshift dependence of /3 we have used 
emission line contamination estimates from Eales 
& Rawhngs (1997) and van Breugel et al. (1998) 
to explicitly correct the magnitudes in the 6C and 
high redshift samples (where these were unknown 
the object was omitted from the plot and corre- 
lation analysis), and our own estimates to correct 
the 7C-III magnitudes.] The results are shown in 
Fig. 5, which shows that there is indeed a cor- 
relation of Lq with redshift. This was confirmed 
using the Kendall Tau statistic generalized to in- 
clude limits, as implemented in the IRAF.STSDAS 
program bhkmethod (Isobe & Feigelson 1990), 
which returned a r = 0.52 for the 106 objects from 
the 3C, 6C, 7C and HZ samples after correction. 
This corresponds to a probability that there is no 
correlation between Lq and redshift of < 0.0001. 
However, this probability increases to 0.13 if the 
HZ sample is removed. We have also conducted 
the same statistical test before correction for radio 
luminosity-dependent effects in equation (1). This 
gave a r = 0.95 and a probability of no correla- 
tion of < 0.0001, removing the HZ sample reduced 
this to T = 0.66, again with the probability of no 
correlation of < 0.0001. 

4.2. The dispersion in the host luminosi- 
ties 

The low scatter in the magnitudes of radio 
galaxies in the K — z Hubble Diagram out to z Z 3 
has long been tised as an argtiment for a high red- 
shift of radio galaxy formation (e.g. Lilly 1989). 
This low scatter is thought to come about because 
radio galaxies form at z > 3 then evolve along sim- 
ilar passive evolution tracks to end at radio galaxy 
hosts today. With a few modifications this simple 
picture still seems to be basically valid. Fig. 3 
and Table 4 show that the dispersion in absolute 
magnitudes to z ~ 4 is indeed very low. After 
correction for AGN-contamination, however, the 
dispersion in Lq for the z > 1.8 objects seems to 
be higher than for the uncorrected objects (Table 
4). As discussed above, the correlation of absolute 
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magnitude with redshift is also less tight after cor- 
rection. 

At first sight it seems paradoxical that cor- 
rection for luminosity-dependent effects should 
increase the scatter in the absolute magnitudes. 
However, this can be be understood if the luminosity- 
dependent contributions {a + /3)ifl^i5i) are ap- 
proximately equal to the highest values of Lq. 
(The ^; > 1.8 objects have a very narrow range 
in Ln as the range in flux of these objects is low, 
all except VLA 123642-1-621331 having 151 MHz 
fluxes between 0.5 and 10 Jy.) The addition of 
the luminosity-dependent c;ontributions can then 
boost the objects with low Lq by a significant fac- 
tor, whereas the addition to the objects with high 
Lq results in only a relatively small fractional in- 
crease in the luminosity (see also de Vries 1999). 
The result of this is a reduction of the scatter in 
the total luminosities. 

We have used the F-test for variances to ex- 
amine the statistical significance of the increase in 
scatter of the Lq values. Comparing the objects 
with 0.8 < z < 1.8 to those with 1.8 < z < 2.8, 
we find that the increase in the sample standard 
deviation from ct„_i = 0.53 to cr„_i = 0.93 is sig- 
nificant at the 0.1 % level. At z > 2.8, our sample 
contains two upper limits on the magnitudes, so 
we give a lower limit to the scatter of cr„_i > 0.97. 
This is again significantly higher than that in the 
range 0.8 < z < 1.8 (at the 0.2 % level). Thus 
there is good evidence that the scatter in Lq is 
increasing with redshift, suggesting that we are 
close to the formation epoch of at least a signifi- 
cant fraction of the radio galaxy population. 
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Fig. 5. — The redshift dependence of radio galaxy 
absolute magnitude after approximate correction 
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Table 4: Scatter in radio galaxy absolute magnitudes as a function of redshift 



RcdHhift 




Mean 


Mr 




b 

-1 






uncorrected 


corrected 


luieorrtX'tiHl 


corrected 


0.8 < z < 1.8 


44 


-24.0 


-23.6 


0.5 


0.5 


1.8 < 2 < 2.8 


25 


-24.6 


-24.2 


0.7 


0.9 


z > 2.8 


15 


>-24.3 


>-25.1 


>0.7 


>1.0 



"Number of objects in each sub-sample 
''Standard deviation of the sample. 

Note. — Mji is measured in a 63.9 kpc metric aperture. Uncorrected magnitudes have fe-corrections only, no corrections for 
emission line or AGN luminosity have been applied. Corrected magnitudes have, in addition to fe-corrections, emission-line 
contamination corrections and a correction for AGN luminosity as detailed in Section 4.1. 
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4.3. The scale sizes of the hosts 



Roche ct al. (1998) have shown that the scale 
sizes of the 1 < 2; < 1.4 6C galaxies are signifi- 
cantly smaller than their more radio luminous 3C 
counterparts as measured by Best et al. (1997), 
suggesting a strong dependence of scale size on 
AGN luminosity. In this section we re-examine 
this using our data on the 7C objects, and further 
data on 3C scale sizes published recently. 

To compare with the 6C results, we have de- 
fined complete samples of 7C-III and 3C radio 
galaxies from LRL in the redshift range 0.8 < 2; < 
1.4 (we took the lower limit of 0.8 so as to include 
more 3C and 7C objects; only two 6C objects he in 
the redshift range 0.8 < 2; < 1, and we do not feel 
their exclusion will significantly affect the results). 
For the 3C objects we have used size estimates 
from the 2D modelhng of McLurc & Dunlop (2000) 
or the Keck /C-band images of Ridgway & Stock- 
ton (1997) where available, and otherwise from 
Best et al. (1997). No estimates were available 
for three of the sixteen objects in the 3C sample, 
and we have omitted 3C356 due; to confusion over 
the true identification. One of the objects missing 
from the 3C sample were omitted from the sample 
of Best et al. (1997) (3C 263.1), and the other two, 
3C 13 and 3C 368 have K-h&nd structures which 
are clearly affected by AGN-related emission as 
they are closely aligned with the radio axes. Of 
the 7C-III objects, only one, 7C 1816-h6710, has no 
scale size estimate as its image was taken in poor 
seeing, although the scale size of 7C 1742-1-6346 is 
an upper limit only. All the 7C scale sizes were 
increased by a factor of 1.06 to correct for an ex- 
pected mean ellipticity of 0.2. 

In Fig. 6 we plot the scale sizes of the 3C, 6C 
and 7C radio galaxies in the samples defined above 
against host absolute magnitude. Although the 
scatter is large it does seem that the 7C-III ra- 
dio galaxies have scale sizes consistent with their 
magnitudes when compared to the 3C radio galax- 
ies, although the 6C objects are plotting below the 
general trend. The mean scale size for the 3C sam- 
ple is 11.1 ± 1.4 kpc and that for the 6C sample 
4.7 ± 0.9kpc. For the 7C-III sample, wc find a me- 
dian scale size of 0.9 ± 0.4 arcscc, which, when 
corrected for a mean ellipticity of 0.2, corresponds 
to about 8 ± 3 kpc. (Using the whole subset of 
7C-III sources in the range 0.8 < z < 2.7 with 



1.5 



o 



0.5 



X LRL, 0.8<z<1.4 

o 6C, l<z<1.4 

■» 7C-IIL 0.8<z<1.4 



-22 



-23 



-24 



-25 



-26 



Fig. 6. — The dependence of half-light radius on 
host absolute magnitudes. Errors are shown for 
the 7C objects only. The dot-dashed line rep- 
resents the correlation observed for local ellipti- 
cals (equation (2) with AB. = 0); the dashed line 
the same relationship corrected to 2; ~ 1 using 
the mean properties of the 3C hosts (i.e. with 
AR = 0.68). 
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measured scale sizes we obtained the same result.) 
The mean for the 7C-III is thus between that of 
the 6C and 3C samples, and statistically consis- 
tent with both. 

A further check is provided by the effective- 
radius - galaxy magnitude relationship discussed 
by Roche et al. (1998), 

lg(rw/kpc) = -0.3(Mfl + AR + 20.01), (2) 
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Fig. 7. — Half-light radius in arcseconds versus 
redshift. The lines show the angular size of a stan- 
dard rod of length equal to the mean of the half- 
light radii of local FRII host galaxies (13 kpc) as 
a function of redshift in an Qm = 1 , ^^A = cos- 
mology (dotted line), an Qm = 0.3, Oa = 0.7 cos- 
mology (long-dashed line) and an Qm = 0, r^A = 
cosmology (short-dashed line). 



where AR is the amount by which the hosts have 
brightened by passive evolution from z = 0. To set 
AR, we used the mean scale size of the 3C galaxies 
in our 0.8 < z < 1.4 sample of 11.1 kpc, and 
the mean magnitudes of the same objects using 
the photometry of Best ct al. (1998). This gave 
AR, = 0.68. We have plotted the relationship of 
equation (2) on Fig. 6, for both AR = and AR — 
0.68. The 7C radio galaxies arc consistent with 
this relation for AR = 0.68 (within the scatter), 
but most of the 6C radio galaxies plot significantly 
below the line. 

We therefore believe that a combination of poor 
seeing in both the Roche et al. (1998) and Best et 
al. (1998) studies, which lead to poor scale size es- 
timates, combined with small number statistics in 
the Roche et al. study, can probably explain the 
result of Roche et al. without the requirement for 
scale-size to depend very strongly on radio lumi- 
nosity, though clearly better images of both the 
6C and 7C objects will be needed to be certain of 
this. 

We have also investigated possible redshift de- 
pendences of scale size with the addition of data 
on objects in the HZ sample listed in Table 3. Fig. 
7 shows the results of plotting the scale size in arc- 
sec against redshift. The mean half-light radius for 
local hosts of classical double FRIIs from Owen & 
Laing (1989) and Govoni et al. (2000), » 13 ± 2.5 
kpc, is also plotted line in this figure for three 
different cosmologies. There is some evidence for 
a weak trend for scale size to decrease with in- 
creasing redshift, but with a lot of scatter. In an 
attempt to reduce the scatter we have tried using 
a correction based on equation (1), by introducing 
a correction in the log of the scale size: 



Algrhi = -0.3(M-Mo(z)), 



where M is the absolute i?-band magnitude af- 
ter correction for line contamination, and Mo{z) is 
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Lq{z) converted to absolute magnitude. With this 
correction, using the 7C, 6C, 3C (0.8 < z < 1.4) 
and HZ samples there is a weak anticorrelation of 
scale size with rcdshift, significant at about the 
5% level using the bhkmethod. 

Inspection of Fig. 7, and the comparison to low 
redshift FRII hosts, shows that the scatter in scale 
sizes is large at all redshifts, and suggests that any 
evolution in scale size is weak. FRI hosts at low 
redshift, however, have significantly brighter hosts 
and larger scale sizes at a given radio luminosity 
(e.g. Owen & Ledlow 1994), suggesting that merg- 
ing in the richer cluster environments associated 
with these objects may be significant if they are 
the descendents of more luminous sources at high 
redshift. An argument against this is provided by 
a recent study of brightest cluster galaxies (BCGs) 
by Burke, Cohins & Mann (2000), who show that, 
when selection effects are properly taken into ac- 
count, the BCGs of the most X-ray luminous clus- 
ters brighten with redshift (to a similar extent as 
the FRII hosts, in fact), and thus there is little ev- 
idence of significant merger activity for redshifts 
< 2 < 1. However, because the FRII hosts 
are significantly less luminous than BCGs, merg- 
ers that would enhance the luminosity of a BCG 
by only a few percent would enhance that of an 
FRII host by a much larger factor. Nevertheless, 
the lack of evidence for merger activity in even 
rich clusters suggests that FRII hosts, which are 
generally found in poor cluster environments (e.g. 
Wold et al. 2000), are probably not significantly 
affected by merging, at least at ^; < 1. 

4.4. Surface brightness evolution 

To demonstrate that the redshift dependence 
of host magnitude is independent of the assumed 
cosmology we have plotted in Fig. 8 the effective 
surface brightness (defined here as the fiux within 
rhi divided by Trr^ converted into magnitudes) 
in the rest-frame _R-band versus rcdshift for the 
sub-sample of 7C-iii galaxies with estimated scale 
sizes. We have compared this to the cosmological 
surface brightness dimming expected in a standard 
expanding Universe with a Friedmann-Robertson- 
Walker metric. The surface brightness decreases 
with rcdshift much less rapdly than the prediction 
for a non-evolving stellar population (solid line in 
Fig. 8). This demonstrates that the brightening 
of the hosts with redshift is a real evolutionary ef- 
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Fig. 8. — The effective surface brightness as a 
function of redshift for the 0.8 < z < 2.7 7C-III 
galaxies observed with the IRTF, after correction 
for emission line contamination. The crosses are 
the mean values of the surface brightness in the 
redshift range indicated by the horizontal bars and 
with an error in the mean indicated by the vertical 
bar. The mean local value for FRII radio galax- 
ies from Govoni et al. (2000) and its expected be- 
haviour with redshift is indicated by the solid line, 
with the dashed lines showing the ±la range. 
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feet, and not one produced by an incorrect choice 
of f^M and f^A- 

5. The evolution of radio galaxy hosts 

The evolution of host magnitude and rest-frame 
surface brightness with redshift, the lack of strong 
evolution in scale sizes in the subset of objects with 
"undisturbed" morphologies, and the low scatter 
(< 1 magnitude) in the absolute magnitudes up 
to at least z ^ 3 are all consistent with an early 
formation epoch (z 3) for most, and perhaps all, 
radio galaxy hosts. At first sight these results are 
in conflict with standard CDM-based models for 
galaxy formation, in which hierarchical structure 
formation occurs in a "bottom up" fashion with 
the most massive galaxies forming late. Radio 
galaxies at 2 ~ 2 are much brighter than the aver- 
age quasar host predicted by the models of Kauff- 
mann & Haehnelt (2000), for example. To com- 
pare our radio galaxy sample to these predictions 
we have assumed an equivalent quasar luminosity 
equal to the mean of the 1 < 2; < 3 7C quasars, 
Mb « -25 (Willott et al. 1998). For quasars of 
this luminosity, the Kauffmann & Haehnelt mod- 
els predict a mean host magnitude of My « —21.5 
compared to our radio galaxies with a luminosity- 
corrected median Mr » —23.5 (corresponding to 
My ~ —23). In contrast, the hosts of radio-quiet 
quasars at z ^ 2 seem to fit this model quite well 
(Ridgway ct al. 2000). 

Between z ~ 1.3 and z ~ 2.3, there is evidence 
for an increasing scatter in base host luminosity. 
This suggests we are close to the formation epoch 
at z ~ 2.3. There are two other pieces of evi- 
dence which point to strong evolution in the host 
galaxies and which appear at z > 2.5. First, the 
observations of Pentericci (1999; and Pentericci 
et al. 2000) and van Breugel et al. (1998) show 
that the morphologies of radio galaxy hosts at 
z < 2.5 are mostly relaxed ellipticals, whereas at 
higher rcdshifts (z > 2.5 in the HST NICMOS 
H-band images of Pentericci et al. and z > 3 in 
the ground-based K-hand images of van Breugel 
et al.) hosts with clumpy structures aligned with 
the radio source axis become much more com- 
mon^. Second, the detection rate of continuum 

■^van Breugel et al. argue that this is not simply due to the 
2: > 3 objects being observed at shorter rest-frame wave- 
lengths, as lower redshift objects observed at the same rest- 



submillimcter emission from hot dust seems to rise 
rapidly at z > 2.5 (Archibald et al. 2000). The 
6C and 7C samples are completely identified with 
luminous host galaxies out to at least z « 3, how- 
ever, which would suggest the bulk of the stars 
had formed and/or merged onto the host by z ~ 3. 
One way around a high formation redshift for all 
objects would be to assume a systematic delay be- 
tween a host forming the bulk of its stars and the 
switching on of powerful radio jets (for example, if 
the black hole needs time to accrete enough mass) . 
Radio galaxies would then always be seen with lu- 
minous hosts. However, the strong evolution in 
the hosts seen by van Breugel et al. around z ~ 3 
suggests that their nature is still changing up to 
at least that epoch. 

Taken together, the evidence suggests that the 
epoch of radio galaxy host formation was essen- 
tially over by z ~ 3, in the sense that most objects 
had formed the bulk of their stars, although fur- 
ther star-formation and merger activity continued 
in many objects up to z 2. As radio galaxies can 
be found out to z > 5, however (van Breugel et al. 
1999), and there is no sign of a substantial drop 
in the space density of radio galaxies out to z > 4 
relative to their peak space density at z ~ 2.5 
(Jarvis et al. 1999), this formation epoch must 
last at least a Gyr. In contrast, field ellipticals 
with evolved stellar populations and luminosities 
~ seem to become rarer at z > 1 (Zcpf 1998; 
Barger et al. 1999; Dickinson 1999), suggesting a 
later formation epoch, perhaps z ^ 1 — 2. 

A recent variation on the hierarchical models, 
the so-called Anti-Hierarchical Baryonic Collapse 
Model (Granato et al. 2000) may be able to ex- 
plain these observations. This predicts that the 
massive hosts of powerful AGN may have formed 
at z ^ 3. This model tises the same CDM as- 
sumptions as the standard scenarios, but assumes 
that the dense gas in the most massive haloes can 
collapse and form stars in a massive spheroidal 
component more rapidly than that in the smaller 
haloes associated with normal galaxies. Star for- 
mation is switched off in these objects by winds 
from the AGN and the starburst at high redshift. 
If radio galaxy hosts are indeed typical of the 
most massive elliptical galaxies and their progeni- 

frame wavelengths as the z > 3 objects still look like relaxed 
ellipticals. 
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tors this model should be applicable to them, and 
would explain their anomalous evolution relative 
to generally less luminous field ellipticals. 
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